Chinese Word Sense Disambiguation Based on Lexical Semantic Ontology

نویسندگان

  • Li Li
  • Qiang Zhou
چکیده

This paper describes preliminary works of word sense disambiguation on Chinese verbs using the information derived from lexical semantic ontology (LSO). In spite of sophisticated methods, simple algorithm is employed to underline the characters of the features chosen from LSO data. Several groups of tests are designed to find different effects of the features and other aspects. Some promising results are gotten from the prime tests on nine Chinese ambiguous verbs. The results show what informative features the LSO provides and the potential improving ways.

منابع مشابه

Sense Extraction and Disambiguation for Chinese Words from Bilingual Terminology Bank

Using lexical semantic knowledge to solve natural language processing problems has been getting popular in recent years. Because semantic processing relies heavily on lexical semantic knowledge, the construction of lexical semantic databases has become urgent. WordNet is the most famous English semantic knowledge database at present; many researches of word sense disambiguation adopt it as a st...

متن کامل

A Chinese Corpus with Word Sense Annotation

This paper presents the construction of a Chinese word sense-tagged corpus. The resulting lexical resource includes mainly three components: 1) a corpus annotated with word senses; 2) a lexicon containing sense distinction and description in the feature-based formalism; 3) the linking between the sense entries in the lexicon and CCD synsets. A dynamic model is put forward to build the three kno...

متن کامل

A Novel Method of Text Clustering for Chinese Spam Based on Semantic Body

The effect of spam filtering method based on statistics is not good in filtering the new-type spam with synonymous substitution and camouflage. So a new text clustering method based on Semantic Body for filtering Chinese spam is proposed. In this paper, the word sense disambiguation, lexical chain based on HowNet and statistic-based TFIDF are adopted to extract features of mails. The Semantic B...

متن کامل

A Large-scale Lexical Semantic Knowledge-base of Chinese

The Semantic Knowledge-base of Contemporary Chinese (SKCC) is a large scale Chinese semantic resource developed by the Institute of Computational Linguistics of Peking University. It provides a large amount of semantic information such as semantic hierarchy and collocation features for 66,539 Chinese words and their English counterparts. Its POS and semantic classification represent the latest ...

متن کامل

Ontology-Based Word Sense Disambiguation in Parallel Corpora

Lately, there seems to be a growing acceptance of the idea that multilingual lexical ontologies might be the key towards aligning different views on the semantic atomic units to be used in characterizing the general meaning of various and multilingual documents. Comparing performances of word sense disambiguation systems is a difficult evaluation task when different sense inventories are used a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • Journal of Chinese Language and Computing

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2008